Dynamics of Learning Near Singularities in Layered Networks

نویسندگان

  • Haikun Wei
  • Jun Zhang
  • Florent Cousseau
  • Tomoko Ozeki
  • Shun-ichi Amari
چکیده

We explicitly analyze the trajectories of learning near singularities in hierarchical networks, such as multilayer perceptrons and radial basis function networks, which include permutation symmetry of hidden nodes, and show their general properties. Such symmetry induces singularities in their parameter space, where the Fisher information matrix degenerates and odd learning behaviors, especially the existence of plateaus in gradient descent learning, arise due to the geometric structure of singularity. We plot dynamic vector fields to demonstrate the universal trajectories of learning near singularities. The singularity induces two types of plateaus, the on-singularity plateau and the near-singularity plateau, depending on the stability of the singularity and the initial parameters of learning. The results presented in this letter are universally applicable to a wide class of hierarchical models. Detailed stability analysis of the dynamics of learning in radial basis function networks and multilayer perceptrons will be presented in separate work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Dynamics of the Complex-Valued Neural Network in the Neighborhood of Singular Points

In this paper, the singularity and its effect on learning dynamics in the complex-valued neural network are elucidated. It has learned that the linear combination structure in the updating rule of the complex-valued neural network increases the speed of moving away from the singular points, and the complex-valued neural network cannot be easily influenced by the singular points, whereas the lea...

متن کامل

بررسی شکافتگی کوچک و دیگر تکینگی‌های آینده عالم و برقراری قانون دوم ترمودینامیک در نظریه

The future singularities are possible in a universe that is described by F(R) theory. In previous studies the occurrence of the singularities in F(R) theory have been considered by using a special function for the Hubble parameter and calculating the F(R) function for each of the singularities. Using the specified Hubble parameter causes some difficulties in the study of the second law of therm...

متن کامل

Organizational Learning and Knowledge Spillover in Innovation Networks: Agent-Based Approach (Extending SKIN Framework)

In knowledge-based economy, knowledge has a public good and non-rivalry nature. Firms build their own knowledge stock not only by means of internal R&D and collaboration with partners, but also by means of previously spilled over knowledge of other firms and public research laboratories (such as universities). Firms based on their absorptive capacity, and level of intra-industry and extra-indus...

متن کامل

Pattern reconstruction and sequence processing in feed-forward layered neural networks near saturation.

The dynamics and the stationary states for the competition between pattern reconstruction and asymmetric sequence processing are studied here in an exactly solvable feed-forward layered neural network model of binary units and patterns near saturation. Earlier work by Coolen and Sherrington on a parallel dynamics far from saturation is extended here to account for finite stochastic noise due to...

متن کامل

Analytical D’Alembert Series Solution for Multi-Layered One-Dimensional Elastic Wave Propagation with the Use of General Dirichlet Series

A general initial-boundary value problem of one-dimensional transient wave propagation in a multi-layered elastic medium due to arbitrary boundary or interface excitations (either prescribed tractions or displacements) is considered. Laplace transformation technique is utilised and the Laplace transform inversion is facilitated via an unconventional method, where the expansion of complex-valued...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neural computation

دوره 20 3  شماره 

صفحات  -

تاریخ انتشار 2008